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Abstract 

The problem of forward abstract interpretation of normal logic programs has not been 
formally addressed in the literature although negation as failure is dealt with through the 
built-in predicate ! in the way it is implemented in Prolog. This paper proposes a solution 
to this problem by deriving two generic fixed-point abstract semantics Fp and Fp for 



forward abstract interpretation of normal logic programs 

F P is intended for inferring data descriptions for edges in the program graph where an 
edge denotes the possibility that the control of execution transfers from its source program 
point to its destination program point. Fp is derived from F P and is intended for inferring 
data descriptions for textual program points 



1 Introduction 

Abstract interpretation Q is a program analysis methodology for statically deriving run-time 
properties of programs. The derived program properties are then used by other program pro- 
cessors such as compilers, partial evaluators, etc. Program analyses are viewed as program 
executions over non-standard data domains. Cousot and Cousot first laid solid mathematical 
foundations for abstract interpretation J?], |). The idea is to define a collecting semantics for 
a program which associates with each program point the set of the storage states that are 
obtained whenever the execution reaches the point. Then an approximation of the collecting 
semantics is calculated by simulating over a non-standard data domain the computation of the 
collecting semantics over the standard data domain. The standard data domain is called the 
concrete domain and the non-standard domain is called the abstract domain. 

There has been recently much research into abstract interpretation of logic programs 
Abstract interpretation has been used in both forward and backward analyses of logic programs. 
A forward analysis || approximates the set of substitutions that might occur at a program point 
given a program and a set of goal descriptions. A backward analysis fi |(| |2jJ approximates 

"This draft is a reformulation of Chapter 4 in 
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the set of the atoms that are logical consequences of a program [42 . A number of generic ab- 
stract semantics, often called frameworks, schemes ||, 18, pOl, have been proposed for forward 



abstract interpretation of logic programs. These generic abstract semantics have been spe- 
cialised for the detection of determinacy pTcj], data dependency analyses || [ll], 14, 15, 32, 33 1 , 



mode inference [[|, p|, O, p6 , 32, 41] , program transformation p?| , type inference |4 , 13, 16, 26), 
termination proof |43] 7etc. However, these generic abstract semantics have been developed for 
forward abstract interpretation of definite logic programs. The problem of forward abstract 
interpretation of normal logic programs has not been formally addressed in the literature al- 
though negation as failure is dealt with in practice through the built-in predicate ! in the way 
it is implemented in Prolog. This paper proposes a solution to this problem by deriving two 
generic fixed-point abstract semantics F P and Fp for forward abstract interpretation of normal 
logic programs without relying on any capability of dealing with the built-in predicate !. 

Existing generic abstract semantics in the literature are optimisation-oriented and they are 
used to infer data descriptions for textual program points. However, there are some applications 
such as debugging where it is helpful to infer data descriptions for edges p<-»q in the program 
graph where p<-»<? denotes that the control of execution may transfers from program point q to 
program point p. Fp is intended for these applications. Fp is derived from Fp and is intended 
for inferring data descriptions for textual program points. 

The way in which Fp and Fp are derived is conventional. Fp is based on a fixed-point 
collecting semantics that associates a set of substitutions with each edge p^-»q in the program 
graph. The set of substitutions associated with p^-»q includes all the substitutions at program 
point p whenever the control of execution transfers from program point q to program point p. 
The collecting semantics is obtained through two approximations. The operational semantics 
SLDNF-resolution via the left-to-right computation rule is first approximated by a transition 
system. The transition system is then approximated by the collecting semantics. Obtained 
from the collecting semantics by a further approximation is F p that can then be specialised to 
perform various analyses under certain sufficient conditions. Fp is derived from F p through 
one more approximation and it can be specialised to perform various analyses under the same 
sufficient conditions as Fp. 

This paper makes two contributions. Firstly, Fp and Fp can be used to perform forward 
abstract interpretation of normal logic programs while the existing generic abstract semantics 
in the literature can only be used to perform forward abstract interpretation of definite logic 
programs. A common practice in analysing logic programs with negation as failure is to extend 
an existing generic abstract semantics with capability of dealing with the built-in predicate ! 
and then analyse negation as failure in the way it is implemented in Prolog. However, the 
built-in predicate ! is not a part of the language of normal logic programs. The derivation of 
Fp and Fp does not resort to any capability of dealing with any built-in predicate. Secondly, 
F P is easier to specialise for inferring data descriptions for edges in the program graph than 
the existing generic abstract semantics in the literature. 

The remainder of this paper is organised as follows. Section ^ briefly recalls on mathematical 
foundations for abstract interpretation and some terminology in logic program, and introduces 
some notations used later in this paper. Section ^reformulates SLDNF in order to facilitate 
the derivation of the collecting semantics. SectionH derives the collecting semantics from the 
operational semantics. Section [| derives Fp from the collecting semantics through a further 
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approximation, and gives the sufficient conditions for F P to safely approximate the collecting 
semantics, and analyses its worst case complexity. Section [j] derives Fp from F P through one 
more approximation, and analyses its worst case complexity. In section ^, we show how F P 
and Fp can be specialised to infer groundness information. Section ^| reviews related work on 
forward abstract interpretation of logic programs. Section ^| concludes the paper. 

2 Preliminaries 

2.1 Complete lattice 

Let S, Si, S2 be sets. The powerset p(S) of S is the set of subsets of S. p(S) = {X ICS}. 
The Cartesian product Si x S2 of Si and S2 is the set of the tuples with the first components 
in Si and the second components in S2. Si x S2 = {< Si, S2 > | si G Si A S2 G S2}. 

A binary relation R on S is a subset of Sx S. < x, y >G R is denoted as xRy and < x,y >G' R 
is denoted as x$iy. R is reflexive iff xRx for every x G S. R is transitive iff for every x,y, z G S, 
xRy and yRz implies xRz. R is anti- symmetric if, for every x, y £ S, xRy and yi?x implies 
x = y. 

A partial order Z on S 1 is a reflexive, anti-symmetric and transitive relation on 5. A poset 
< S, Z> is tuple where S* is a set and Z is a partial order on 5. 

Let < S, Z> be a poset, X C 5 and u,v € 5. it is an upper bound of A if a; Z it for 
every x G A. An upper bound u of A is the least upper bound of A if it Z v for every other 
upper bound v of A. The least upper bound of A is unique if it exists and is denoted as UA. 
U{xi, X2, ■ ■ ■ , Xk} is sometimes written as xi U X2 U • • • U x^. U{x|-P(x)} is sometimes written as 
Lipr x -\X. Similarly, it is a lower bound of A if u Z x for every x G A. A lower bound u of A is 
the greatest lower bound of A if u Z u for every other lower bound v of A. The greatest lower 
bound of A is unique if it exists and is denoted as ZiA. n{xi, X2, . . . , x^} is sometimes written 
as Xi n X2 n • • ■ n xk- n{x|P(x)} is sometimes written as r\ P i x \x. 

Let < S, Z> be a poset. _L G S is an infimum of < S, Z> if _L Z x for every x G S. Not 
every poset has an infimum. A poset has a unique infimum when it has one. A supremum T 
of < S, Z> is defined dually. 

A complete lattice < S, Z> is a poset such that every ICS has a least upper bound and 
a greatest lower bound. A complete lattice has a unique infimum and a unique supremum. We 
will write a complete lattice < S, Z> as < S, Z, _L, T, n, U > when it is necessary to make the 
infimum _L, the supremum T, the greatest lower bound operator n and the lease upper bound 
operator U explicit. 

Let D and D be sets. D 1— » D denotes the set of total functions from D to D. A total 
function f from D to D is a subset of D x D such that, for every d G D, there is one and only 
one d G D such that < d,d >G /. < d, d >G / is denoted as d = f(d). 

Let / G D D and g G -D 1— > 13. We use g ■ f to denote the composition of two functions 

f and g ■ f = J Ax G D.g(f(x)). 

Let < L>, Z, _L, T, n, U > and < D, Z, _L, T, U > be complete lattices, and / G D h-> D. f 
is monotonic if /(x) Z /(y) for any x,y G 13 such that x Z y. 
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Let < D, C> be a complete lattice and / G D D. i e D is a fixed-point of / if x = f(x). 
x is the least fixed-point, denoted by Ifpf, of / if x Q y for each fixed-point y of /. Ifpf = f f (3 
for some ordinal (3 where 

t a def ( □{/ t /3' I /?' < /?} if (3 is a limit ordinal 

\ /(/ T (/? — 1)) if (3 is a successor ordinal 

2.2 Abstract Interpretation 

We now formalise the notion of abstract interpretation according to the ideas given by [Q. 
The idea of having an element in an abstract domain < D, £ > as description of a element 
in a concrete domain < D, C> is formalised by a monotonic function from D to D, called a 
concretisation function. 

We say that an element d in D is approximated by an element <i in D if d £ l(d). There 
might well be a number of elements that approximate d. If 7 G D h- > £) and f £ I) h D are 
concretisation functions then 7 • 7' G I? 1— > I? is a concretisation function. 

The notion of approximation can also be formalised by means of an abstraction function 
from the concrete domain to the abstract domain, or a Galois connection between the ab- 
stract domain and the concrete domain or a relation between the concrete and the abstract 
domains p7f . 

A fixed-point interpretation of a program is the least fixed-point of a function associated 
with the program on a semantic domain, often a complete lattice. The following theorem shows 
how the least fixed-point of a monotonic function on one complete lattice can be approximated 
by the least fixed-point of another monotonic function on another complete lattice. 

Theorem 2.1 If < D, C> and <£>,£> are complete lattices, F a monotonic function on 
< D, F a monotonic function on <£>,£>, 7 a monotonic function from D to D and 
VJ G D.(F ■ 7(d) C 7 • F{d)) then IfpF C j(lfpF). 

Proof: See & I 



2.3 Logic programming 

We assume that the reader is familiar with the terminology in logic programming p4| ] . Let C be 
a first order language with function symbol set S and predicate symbol set II which is disjoint 
from S. Let MAR. be a denumerable set of variables and V C VATZ. TERM(E, V) denotes the set 
of terms that can be built from E and V. ATOM(II, E, V) denotes the set of atoms constructible 
from IT, E and V. The negation of an atom A is denoted as -*A. A literal is either an atom or 
the negation of an atom. 

Let 9 and a be substitutions, a o 9 denotes the composition of a and 9. dom(9) denotes 

def 

the domain of 9. Define Sub = {9 \ 9 is a substitution}. An expression is a term, an 
atom, a literal, a clause, a goal etc. The set of variables in an expression E is denoted as 
vars(E). For an expression E and a substitution 9, E9 denotes the instance of E under 9. An 
expression E' is an instance of another expression E if E' = E9 for some substitution 9 where 
A = B denotes that A is syntactically identical to B. Let 9 be a substitution and V C VAJZ. 
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9 | V = {X/t e 9 \ X <EV} is the restriction of to V. The convention is that o binds stronger 
than |. For instance, -q o a \ V is equal to (n o a) f V. 

Two substitutions cr and 9 are equivalent modulo renaming if there are two renamings S and 
p such that a = 9oS and = a op. We write cr = 9 to denote that a and 6* are equivalent modulo 
renaming. We will not distinguish those substitutions that are equivalent modulo renaming. = 
is naturally extended to expressions. Let E\ and E 2 be two expressions. E\ = E2 if there are 
two renamings <5 and p such that E\ = E 2 S and E 2 = E\p. 

An equation is a formula of the form I = r where I and r are terms or atoms. The set of 
equations is denoted as Eq. Let E G p(Eq). E is in solved form if, for each equation I = r in 
E, I is a variable and / does not occur in the right hand side of any equation in E. There is 
a natural bijection between substitutions and the sets of equations in solved form. Therefore, 
we sometimes write a substitution as a set of equations in solved form. The unification of a 
set of equations is decidable and the most general unifiers for a set of equations are equivalent 
modulo renaming. Let mgu be the function from p(Eq) to {fail} U (WVZ TERM(S, VAR,)) 
that, given a set of equations E, either returns a most general unifier for E if E is unifiable or 
returns fail otherwise. mgu({l = r}) is sometimes written as mgu(l,r). 

A normal clause is a formula of the form H <— L\, L2, ■ ■ ■ , L n where H is an atom, Li for 
each 1 < i < n is a literal. H is called the head of the clause and L\, L 2 , ■ ■ • , L n the body of the 
clause. A normal goal is a formula of the form <— L\, L 2 , ■ ■ ■ , L n with Li for each 1 < i < n 
being a literal. A normal program is a set {d \ 1 G He} of normal clauses where Kg is a finite 
set of distinct natural numbers. Let m[i] denote the number of the literals in the body of clause 
C l . We write C l as H t <- £( 8 ,2), ■ ■ ■,i(,,m[.])- 

A gtter?/ to a program is a goal that initiates the execution of that program. There might 
be infinite number of possible queries that a program is intended to respond to. For the time 
being, we denote the set of all possible queries as {Gk<9k \ k G Kg} where Kg is a finite set of 
distinct natural numbers such that Kg n Kg = 0- Gk for each k G Kg is a normal goal and 6fc 
is a set of substitutions 9k- Each Gk9k with 9k G 8/t is a query. Let m[k] be the number of 
literals in G k . We write G k as <- L (fc;1) , L (fei2 ), • • • ,i(fe,m[fe])- 

Let K = Kg U Kg- Let Pi refer to G\ for z G Kg and to refer to Gi for i G Kg and define 
Vi d = vars(Pi). 

Let i G K. We designate Pi with + 1 program points, point (i, j) immediately before 

def 

L(i^ for 1 < j < m[i] and point (i,m[i] + 1) immediately after Lu^ m uv\. entry(i) = (i, 1) is 

def 

called the entry point of Pi and exit(i) = (i, m[i] + 1) the exit point of Pi. 

We denote by Mp the set of all program points designated with Pi for all i G K. Let 
p = (i,j) be a program point. p[l] = i denotes the index to the clause or the query to which p 
belongs. p[2] = j denotes the position of p in the clause or the query. So, p = (p[l],p[2)). 
We define two partial functions Xp.p + and Xp.p- over the set of all the program points. 
p + = (p[l],p[2] + 1) is defined for each p such that p[2] < m[p[l]] and 
p- = (p[l],p[2] — 1) is defined for each p satisfying 2 < p[2] < m[p[l]] + 1. p + is the pro- 
gram point to the right of p if p + exists and p- is the program point to the left of p if p- 
exists. 

Let p G Afp. B p denotes the atom in literal L p . If L p is positive then L p = B p . If L p is 
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negative then L p = ->B P . 

Example 2.2 The following normal logic program will be used in several examples in this 
paper. The meaning of member(X, L) is that X is a member of list L. The meaning of 
diff(X, L, K) is that either X is a member of list L or X is a member of list K but X is not 
both a member of L and a member of K . 

C\ = dif f{X, L, K) <— member(X, L), -^member(X, K) 

C2 = dif f(X, L, K) <~ member{X , K), ->member(X, L) 

C3 = member [X, [X\L]) <— 

C4 = member (X, [H \L]) <— member (X, L) 

Suppose that the set of queries is described by {6*585} with G5 =<— diff(X,Y, Z) and 65 is 
the set of substitutions 9 such that XO is a variable, and both L9 and K9 are ground terms. 
Then, Nc = {1, 2, 3, 4} and Kg = {5}. Afp contains 11 program points. £n,i) = member(X, L) 
and L (li2 ) = -mem&er(X, if). V X = {X, L, K} and V 5 = {X, Y, Z}. I 



3 Operational semantics 

The operational semantics we consider is the SLDNF-resolution via the left to right computation 
rule. We shall not mention the computation rule explicitly. When the set of all the descendant 
goals of a set of queries is to be computed, it is necessary to resolve the current goal with every 
clause whose head unifies with the selected atom in the current goal. Therefore, the selection 
rule for choosing a particular clause to resolve with the current goal is not of interest. ^ uses 
general SLD-resolution as operational semantics of definite logic programs and does not take 
the computation rule into account. This usually leads to less precise analyses JL6|. 



3.1 SLDNF 

We now briefly recall on SLDNF-resolution (SLDNF in short). The renamed literals will be 
written in the form Lp where L is a literal in a clause or a query and p a renaming used in 
standardisation apart 0. 

First consider SLD-resolution (SLD in short) for definite logic programs where every literal 
is positive. For a query and a definite program, SLD works by repeatedly resolving the current 
goal, initially the query, with a clause in the program. In one resolution step, SLD nondeter- 
ministically selects a clause in the program, renames the clause so that it does not have any 
common variable with the current goal, and derives a new goal by replacing the leftmost literal 
in the current goal with the body of the renamed clause and applying to the resultant the most 
general unifier of the head of the renamed clause and the leftmost literal. 

If SLD has derived an empty goal, it has successfully computed a computed answer substi- 
tution to the query. The computed answer substitution is the restriction, to the variables in 
the query, of the composition of the most general unifiers in the derivation from the query to 
the empty goal. 

With SLD, only positive information can be derived from a program. SLDNF uses the 
negation as failure rule to derive negative information. SLDNF deals with positive literals in 
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the same way as SLD. Suppose that the leftmost literal in the current goal is negative. SLDNF 
first recursively invokes itself with the leftmost literal as the query of the recursive invocation. 
If the recursive invocation fails then SLDNF removes the leftmost literal from the current goal 
and continues with the resultant as the new current goal. Otherwise, according to the rule for 
the weak safe uses of negation as failure 0] , SLDNF either backtracks if the computed answer 
substitution returned by the recursive invocation is a renaming or otherwise flounders. 
Let the current goal be 

«- (L(i it k)Pj, L(i jik+1 )pj, ■ ■ •,L(j 3 . )m [ ij ])/5j, • • -)T(j t k) (m) 

where pj is the renaming used to rename clause C^. If L(i k ) = ~~'B(i j ,k) then SLDNF recur- 
sively invokes itself with <— Bu.^tq^) as the query of the recursive invocation. 

LetL(j iifc ) =B( ljk y Ifthere is a clause Ci i+1 = H ij+1 <— L( ij+Li2 ), ■ ■ ■ , 

in the program and a renaming pj+\ such that 

vars(C ij+1 p j+1 ) n vars{{L^. ^pj, L^. tk+1 )Pj, ■ ■ O^O'.fe)) = (!) 
and B(i. t k)pjT(j,k) an d Hi j+1 pj + i unify, then the new current goal becomes 

<- (L(i j+1 ,i)Pj+i, ■ ■ ■ , L(i j+um [i j+1 ]-)pj + i, L( ijik+l )pj, ■ ■ -) T 0+i,i) ( R2 ) 
where 

T (j+i,i) = T U,k) °V (2) 

and 

r) = mguiHi^pj^BfatfPjTytf) (3) 
Suppose that there is a sub-refutation of 

(L(i j+u i)Pj+i,L(i j+u 2)pj+i, ■ ■ ■ ,i(i i+1 , m [i J+1 ])/3i+i)r( J+ i.i) 

with the composition of the most general unifiers used in the sub-refutation being 9. Then the 
next current goal immediately after the sub-refutation is 

*- {L(i j ,k+i)Pj,L(i :l! h+2)Pj,- ■ ■ , LfaMiifiPi'' ' ') T (i,fc+i) (R3) 
where 

T(j,k+l) = T U+i,i) ° = T (j,k) ( 4 ) 



3.2 VSLDNF 

We now propose a variant of SLDNF (VSLDNF in abbreviation) as the operational semantics. 
VSLDNF is equivalent to SLDNF in the sense that, given the same goal and the same program, 
VSLDNF reaches a program point iff SLDNF reaches the same program point, and the instan- 
tiation of the variables in the clause of the program point by VSLDNF is equivalent (modulo 
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renaming) to that by SLDNF. We now formulate VSLDNF and then establish the equivalence 
between SLDNF and VSLDNF. 
Let the current goal be 

<— (£(y,k)>-k(y,k+l)) • ' ' i" C '(ij,mfe])) cr (j,fc) (Rl') 

If = "iBjj then VSLDNF recursively invokes itself with <— B^. : k) a (j,k) being the 

query of the recursive invocation. 

Let £(i ,fe) = B^.j-y The derivation is suspended and a sub-derivation is started as follows. 
If there is a clause C ij+1 = H ij+1 <- L(^ +1 , 2 ), • • • , £(i j+1 , m [ 4j+1 ]) in the program and a 

renaming V'j+i such that 

V ij+1 n vars(L^ ik) a(j^ip j+ i) = (5) 
and Bu.^\<jij t f.\tj)j^.i and Hi j+1 unify then the next current goal becomes 

ife+i,!)) 1 '^!^).-- ' ,i(i j+1 ,m[i ) ' +1 ]))^'+l,l] (R- 2 ') 

where 

°"(j'+x,i) = m 9 u { B {ij,k)^( .k)^]+i,H i]+l ) (6) 

We call the step to derive a new goal from the current goal and a clause in the program 
procedure- entry. 

Suppose that there is a sub-refutation of 

and C(j+i, m [i +1 ]+i) be the substitution immediately after the sub-refutation. Then the sus- 
pended derivation is resumed and the new current goal becomes 

<- (£(ij,fe+l)) • • • (-kfo.mfottV&fe+l) ( R3 ') 
where 

o"(j,fe+i) = °"(j,fc) ™3-"(B(. ij!fe) CT( jife ),i/i j+1 CT( j+ i !m [ i3+1 ] + i)(?!)j + i) (7) 
and (^j+i is a renaming such that 

vars(H ij+1 o-( j+ltm[i . +l]+1 )<pj + i) n vars(C tj cr^ k ) = (8) 

We call the step to derive a new goal from a suspended goal and a completed sub-derivation 

Lemma 3.1 VSLDNF is equivalent to SLDNF in the sense that, given the same goal and 
the same program, VSLDNF reaches a program point iff SLDNF reaches the same program 
point, and the instantiation of the variables in the clause of the program point by VSLDNF is 
equivalent (modulo renaming) to that by SLDNF. 



Proof: See p. 32 
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Example 3.2 This example illustrates VSLDNF. Let the program be that in example 2.2 and 
<— diff(X, [2, 1], [3, 1]) be the query. Let C(o,i) = 0- VSLDNF begins with the following current 
goal. 

<-di//(jr,[2,l],[3,l])£r (0) i ) (GO) 

Let Vi = {X/X^ and C n = C x . Then 

<r (1)1) = mgu(diff(X,[2,l],[3,l])a {0A) ?p u diff(X,L,K)) 
= {Xi/X,L/[2,1],K/[3,1]} 

VSLDNF suspends goal (GO), performs a procedure-entry, and derives the following goal. 
<— (member(X, L),-imember(X, K))(j(i i) (Gl) 
Let ^2 = {X/X 2 } and C J2 = C 3 . Then 

(T(2i) = mgu{member(X, L)an i)ip2, member(X, [X\L])) 
= [X 2 /2,X/2,L/[1]} 

VSLDNF suspends goal (Gl), performs a procedure-entry, and derives the following empty goal. 
□<t C2>1) (G2) 
VSLDNF performs a procedure-exit step and derives the following from (Gl). 
<— -imember(X, K))<j(i.2) (G3) 

where, letting 2 = 0, 

""(l 2) = C(i l) mgu(member(X , L)(7( 1 ^, member(X, {X\L))<j(2 i)4>2) 
= {x\/2,X/2,L/[2,l],K/[3,l]}' 

The leftmost literal of (G3) is negative, VSLDNF invokes itself recursively with 
i— member (X, K)) a ( 12 ) and fails to refute it. So, by the negation as failure rule, VSLDNF 
derives the following goal with 07i,3) = C(i,2)- 

□cr ci>3) (G4) 

This finishes a sub-refutation of (Gl). VSLDNF performs a procedure-exit step and derives 
the following from (GO). 

□<7 C0>2) (G5) 

where, letting <j>\ = 0, 
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C(o,2) = cr (0l i) o mgu(diff(X, [2, 1], [3, l])<7(o,i), diff(X, L, if )er(i,3)<M 
= {X/2} 



This finishes a refutation of (GO). So, er( .2) = {^/2} is a computed answer substitution of 



VSLDNF differs from SLDNF in several ways. Firstly, a goal in VSLDNF is a part of a clause 
or a query, in particular, it is a tail of a clause or a query. This helps to approximate VSLDNF 
as a transition system later. Secondly, when VSLDNF derives a new goal from the current goal 
and a clause, it renames the leftmost literal in the goal instead of the clause. This is to ensure 
that the domain of the substitution that will be applied to the body of the clause contains 
variables in the clause instead of their renamed counterparts. Thirdly, when a sub-refutation is 
finished, an extra renaming and an extra unification are needed for VSLDNF to calculate the 
substitution immediately after the sub-refutation whilst these extra operations are not needed 
in SLDNF. Note that VSLDNF is only used in formulating the collecting semantics. 

3.3 Program graph 

Let p,q E Afp, and q be the most recent program point that VSLDNF has reached. There 
are several possibilities that VSLDNF will reach p next. If q is the exit point of a clause then 
the only way that VSLDNF can reach p immediately is to perform a procedure-exit. This 
can happen only if L p - is positive and that program clause has been used to resolve with 
L p -. If q is not the exit point of a clause then VSLDNF may reach p immediately either by 
performing a procedure-entry or by applying the negation as failure rule. VSLDNF may reach 
p immediately by applying negation as failure rule if L q = ^B q hq = p~. VSLDNF may reach p 
immediately by performing a procedure-entry either directly when L q is positive or indirectly 
when L q is negative. Note that if q is the exit point of a query then VSLDNF has succeeded 
and will not visit any more program points. In order to facilitate further presentation, we 
assume that VSLDNF starts at a dummy program point (0, 0) ^ Mp from where it can reach 
entry points of goal clauses by doing nothing. Therefore, there are four ways that VSLDNF 
will reach p immediately after it has reached q. We use a graph < J\fp ,£p >, called program 
graph, to represent the relation among program points p, q that "VSLDNF will possibly visit p 
immediately after it has visited q. The set Afp~ of nodes in the program graph is A/p U {(0, 0)} 
and each edge p<-»<7 in £ P in the program graph denotes that VSLDNF will possibly visit p 
immediately after it has visited q. Formally, £p is inductively defined as follows. 



^//(V, [2,1], [3,1]). □ 




U 3. 



0<]<3 



C P — 



{entry (k) <-»(0,0) | k £ K G } 
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eh 



Cp 
Cp 



def 



q[2] < m[q[l}} 
A i G K c 

entry(i)^-9q / p is a renaming 

A Bp. I A vars(B q p) rivars(Hi) = $ 
\ A mgu(B q p, Hi) ^ fail 

= {p^«ea;ii(i) | entry(i)^*p- >G £p A L p - = B p -} 

d = {p<-*P~ I L p - = ^B p -} 



£ P is the collection of edges from dummy program point (0, 0) to the entry points of queries. 
Edges in £p correspond to procedure-entries, edges in £p to procedure-exits, and edges in £p 
to the negation as failure rule. £ l P (~l £p = for any < i, j < 3 such that i ^ j. 

Example 3.3 Consider the program in example |2.2| . 

5 e K G . Hence, (5, l)<-»(0, 0) G £ P . Let p = {X/X ,L/L }. 
mgu{B(i t i)p,H^) = {X /X,L /[X\L}} 7^ fail. So, (3, 1)<-»(1, 1) G £p. Since i(i.i) is positive, 
(1,2)^.(3,1) e£ P . Let 6= {X/X ,K/K }. mgu{B (1 . 2) S, H 3 ) = {X /X, K /[X\L}} ± fail. So, 
(3, 1)<-»(1, 2) e £p. Since £(i,2) is negative, (1, 3)<-»(l, 2) G £p- 

There are 23 edges in the program graph for the program. □ 



4 Collecting semantics 

In this section, we present the fixed-point collecting semantics for normal programs. The 
collecting semantics of normal program P is lfpF P where F p is defined below. lfpF p associates 



a set of substitutions with each edge p<-9q G £p. Sub-section 4.1 uses a transition system to 
approximate VSLDNF. A state in this transition system corresponds to a goal in VSLDNF. The 
set of states derivable from a set of initial states by the transition system is then characterised 
as the least fixed-point lfpFp of a function Fp mapping a set of states into another set of states. 
Therefore, the set of goals derivable from a set of initial goals by VSLDNF is approximated 



by lfpFp. Sub-section 4.2 derives the fixed-point collecting semantic function F p from Fp and 



proves that lfpF p is a safe approximation of lfpFp. 
Let A, B be atoms, and 9,uj G Sub. Define 

let p be a renaming such that vars(Adp) D vars(Bui) = 0, 

.mifuiA 8 B cj) = f J lf m 9<A0p,Bu) £ fail 

•' ' » then u o mgu(A9p, Buj) W 

else fail 

Although there are infinite number of renamings p satisfying vars{A9p) n vars(Buj) = in 
equation ^|, only one renaming must be considered when computing unify(A, 6, B,lS) because 
unify{A,9,B,uj) for one renaming is equivalent (modulo renaming) to unify(A,9, B,ui) for 



another renaming accoding to lemma 10.2 
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4.1 Approximating VSLDNF by a transition system 

We now devise a transition system to approximate VSLDNF. A state in the transition system 
corresponds to a goal in VSLDNF. The transition system approximates VSLDNF in the sense 
that, if a goal is derivable from an initial goal by VSLDNF then the state corresponding to the 
goal is derivable from the state corresponding to the initial goal by the transition system while 
the reverse is not necessarily true. 

A state in the transition system is a stack that is a sequence of stack items. The empty 
stack is denoted as $. A stack item is of the form || p^*q, 6 || where p<-»<z G £p and 9 £ Sub. 
The meaning of || p^-»q,9 || is that the control of execution transfers from q to p with 9 being 
the substitution at p. The set of all possible stack items is therefore 

5 s = {|| p^q,0 || | p^q e£ P A0e Sub} 

The set S of all possible stacks is the set of all possible sequences of stack items from <S" . S 
can be inductively defined as follows. 

• $ £ S; and 

• || p^,q, 6 || • S £ S if || p++q, 9 || G 5" A S G S. 

Let xi G <S", . . . , x n £ <S" and S £ S. X\ ■ . . . ■ x n ■ S is sometimes written as 

■111 

•I'll 
S 

The set <So C S of initial states is determined by the set of queries in VSLDNF. 

So = {|| p++q, 9\\-%\\\ p^q, 9 || S <S» Ap^q e £% A 6 e e p[ i]} 
The set of final states is 

Soo = {|| exit(k)<-,q,9 || • $ | k £ K G A 9 G Sub A exit(k)^,q £ £ P } 

The set of descendant states of the set So of initial states is obtained by applying the 
transition rules in figure |]. Rule (0) says that every final state is stable. Rule (la) corresponds 
to direct procedure-entry and rule (lb) to indirect procedure-entry. Rule (2) corresponds to 
procedure-exit and rule (3) deals with negative literals. 

Rule (3) causes the inaccuracy of the transition system. When the transition system reaches 
a state corresponding to a goal with its leftmost literal being negative, the transition system 
may apply either rule (lb) or rule (3). Applying rule (lb), it will go to a state corresponding to 
a goal after performing a procedure-entry indirectly. The application of rule (lb) is to enable 
information to propagate forward so as to ensure that the transition system safely approximates 
VSLDNF. Applying rule (3), it will go to a state corresponding to the goal as if the recursive 
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Rule (0) 


If k G Ng 

then exit(k)<-»q, 9 \\ ■ $ ~> || exit(k)<-»q, 9 \\ ■ $ 


Rule (la) 


/ L q = B q A entry(i)<-*q e £p \ 
J 9 = unify(B q ,a,H i ,e)^fa\\ J 

P 

then q^-uu, a \\ ■ S entry{i)^-uq, 9 || ■ q<-»u, er • S* 




Rule (lb) 


/ L q = ^B q A entry(i)^uq £ £p \ 

■' \A 9 = unify{B q ,cj,H i ,e)^fa\\ ) 
p 

then q^-*u, a \\ ■ S || entry(i)^-uq, 9 \\ ■ $ 


Rule (2) 


V A = unify(H q[1] ,a,B p -,r])^h\\ J 
then q<-+u, a • p-^-*v, n ■ S ~> # • <5 


Rule (3) 


// p<-»g e £p 

then || ■ £ ~> || £>^-»g, • 5 



Figure 1: Transition rules 



invocation of VSLDNF with the negative literal had failed while the recursive invocation may 
succeed in some cases. This results in a simple approximation of VSLDNF. 

The set of descendant states of a set So of initial states is therefore the least fixed-point of 
function Fp that is defined as follows. 



F P (X) 
F P (X) 
F P (X) 



F 2 P (X) 



Fl{X) 



def 



def 



def 



u 



def 



def 



U p3 p{x) 

0<J<3 

p^»q,9 || • $ | p<-»q e£ P A9e e p[1] } 




p^-»q £ £p A L q = B q 
A || q^,u,a || • 5 S X 

A 9 = unify(B q ,a,H p[1] ,e) ^ fail 



p<^-»<7 e £p AL q 



-B„ 



q^-»u, a \\ ■ S & X 

= unify(B q , a, H p[1] ,e) ^ fail 

2 



p<-»g e £p 

II q^»u,a || 

|| ?7 1| ex 



S 

A 9 = unify(H q[1] ,a,B p -,n) ^ fail , 
p<^.g, || -S | € £f> A || || • S G X} 



(10) 

(11) 
(12) 



(13) 



(14) 
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The domain V of Fp is p(S). < T>,C> is a complete lattice and Fp is monotonic on 
< V,C>. 

Rule (0) of the transition system is not embodied in Fp. Since Fp is a monotonic function 
on < T>, C>, we have Fp | k C Fp | (k + 1) for any fc > 0. Therefore, any final state will be 
in IfpFp if it is derivable from an initial state by the transition system. 



4.2 Collecting semantics 



IfpFp is a set of states. A state is a stack that corresponds to a goal. The collecting semantics 
IfpFp first abstracts away the sequential relation between stack items of a stack and then 
classifies the stack items according to edges p<-»q. lfpF p associates each p<-»q in £p with the 
set of substitutions 9 such that || p^-uq,8 || is a stack item in Each edge p^-»q G £p will 
be assocaited with a member from p(Sub). < p(Sub), C, 0, Sub, n, U > is a complete lattics. 
Therefore, the domain £>" of the collecting semantics is the Cartesian product of the same 
component domain p(Sub) for as many times as the number of edges p^-*q in £ P . Let X* e VK 
We use Xp^q to denote the component in X" that corresponds to edge p<-+q. Let X$, Y" G 
Define 



— tj< — ma 



dg 
dg 
dg 
dg 
def 



yp^»q G fp.(X*_ 9 C Yp^) 
n v" 

Sub 



< £> J , _L J , T J , n", U J > is a complete lattice. 

The approximation of a set of stacks by a vector of sets of substitutions is modeled by the 
following monotonic function 7" G 2?" <— > 2?. 



7 »(X«) = < 



Pn-»gi,0i 



Vl<i< n.{pi^ qi e£ P A6 t £ Xl^ qi ) 



(15) 



Let A, £? be atoms, and O, Q, be sets of substitutions. Define 

unify* (A, 9, B, ft) d = 0, B, w) ^ fail | 6 G 6 A w G ft} 

The fixed-point collecting semantics is defined in the following. 



(16) 



[4(^)1^ = 

e p[ i] if P^q e £° P (17) 

\J{unifyt(B q ,Xl^ u ,H pm ,{e}) | q^u G £ P } if p^,q G £ P (18) 
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p-+-*v e Ep 

A q+-9U £ £p 



if p^,q e Ep (19) 



\J{X^ U I q++u e £p} i/ P< -<z e 4 (20) 

F P is a monotonic function on < 2?", c">. 

Example 4.1 Let P be the program in example |2.2[ Fj, is a system of 23 simultaneous recur- 
rence equations. Each equation corresponds to an edge in £p. The following four equations are 
examples of equations p"7f|2(i| respectively. Let A — member(X, L) and B — member(X, [X\L]). 



unifyt(A,xl 1A) ^ iy B,{e}) 

[4(^)](5,1)^(0,0)=©5 
[-Pp(^' l) )]( li 2)<-»(3,1) = 

u Wy J (s,xf3 1) ^ (li2) ,A^f l!l) ^. (5 ,i)) 

U unify*(B, Xl^^, A, xl^^) 
U unify*(B, xL)<_. (2i2) , 4 * ( Yi)^(5,i)) 
U um/y«(S, *( 3)1)< _. {4il) ,4 ^,1)^.(5,1)) 

[^p(^')](l,3)^»(l,2) = ^"(1,2)<-»(3,1) U ^(l,2)<-»(4,2) 



□ 



Lemma 4.2 ZfrFp C ^{lfpF p ) 



Proof: See p. 33 



5 The Generic Abstract Semantics F P 

The collecting semantics lfpF p is a safe approximation of the operational semantics and can 
be used as a basis for program analysis because any safe approximation of this collecting 
semantics is a safe approximation of the operational semantics. WpF P ] p ^_ 9q contains all the 
substitutions whenever the control of execution transfers from program point q to program 
point p. [IfpFp] ^ is usually an infinite set of substitutions and is therefore not computable 
in finite time. In order to obtain useful information about the possible substitutions when the 
control of execution transfers from program point q to program point p, further approximations 
are needed. This section derives the generic abstract semantics F P from F p . 
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5.1 Abstract domains 



The collecting semantics IfpFp associates with each edge p<-*q a set of substitutions which is 
a superset of the set of the substitutions whenever the control of execution transfers from q 
to p. When program is analysed by means of abstract interpretation, the set of substitutions 
associated with p<^-»q is approximated by an abstract substitution associated with p<r-»q. For 
edge p^-»q, only values of the variables in V p h] are of interest and, for edge p'^-»q', only values 
of the variables in V p /[i] are of interest. The abstract substitions for p^*q and p' <r-»q' are from 
different domains when p'[l] ^ We will simply call a domain for abstract substitutions 

an abstract domain. We find it convenient to parameterise abstract domains with finite sets 
of variables instead of having a single abstract domain for all abstract substitutions associated 
with different edges or constructing abstract domains for different edges in different ways. Let 
ASubv denote the domain for abstract substitutions for describing values of variables in V. 
Then [lfpF p ] is represented by a member of ASubv p[1] . We require that, for any finite 
V C MAR,, 

CI: < ASuby, \— v , Ty, Ty, riy,LJy > is a complete lattice where C v is a partial order on 
ASubv, _Ly the infimum, Ty the supremum, Hy the greatest lower bound operator and 
LJy the least upper bound operator; and 

C2: there is a monotonic function 7 V £ ASubv i— > p(Sub). 

The domain V b of F P is constructed in the same manner as the domain 2?" of F p was 
constructed. Each member X b in V b is a vector that is indexed by edges p<r-»q in Ep. X b ^_ 9q is 
an element from ASub v m . Let X b £ V b and Y b £ V b . Define 



x b c b 


Y b 


def 


Vp^g e £p 


■i X p<-»q Ev„[i 


[x b n b 


Y b ] 


def 


X p^»q n V p | 


Y b 

1] *-p^-9q 


[x b u b 


Y b ] 


def 


X p^mq U V P | 


Y b 

1] L p^-9q 


T b 

p<r-»q 




def 


T V P [i] 




± b 




def 


L v p[1] 





< V b , C b , ± b , T b , n b , U b > is a complete lattice. 

The concretisation function 7 b is defined in terms of 7 V . . For every 6 £p and X" E T> b , 

h b ( X %^.q=^V p[1] ( Xb P ^ q ) (21) 

The monotonicity of 7 b £ 2? b i— > £>^ follows immediately from equation [2l| and C2. 
5.2 The Generic Abstract Semantics Fp 

F P is derived from F p as follows. A set £ p(Sub) of substitutions is replaced by an abstract 
substitution 9 b in ^Su&y where V is a set of variables whose values are of interest, unify^ 
applied to two sets of substitutions described by 9 b £ ASuby and a b £ ASubv respectively 



1G 



is replaced by an operator unify uv applied to 9 b and <r b . U in the definition of [F P ] ^ is 
replaced by Dy Let 9 b k G ASubv k be the least abstract substitution such that Ofc C j Vk (0 b k ) 
for each k G Kg. Note that #t instead of Ok is given before the program is analysed. Let 
G ASubvt, called an abstract identity substitution in ||, be the least abstract substitution 
such that e G 7v (eVi) f° r eacn « G K c . ^p is defined as follows. 



def 



if p^-mq G £ P 

U Vp[1] {unify Vq[i] y p[i] (B qi Xl^ ul H p[1]l I Vpll] )\q<^u G £ P } 

if P^q G £p 

LJ Vp[1] |tmi/i^ (t[l]jVjiI1I (fl, [1] ,^ < _ -u) Bp_ ) X^_. u ) A P q< ^J^£ p P 

if p^*q e £p 



(22) 
(23) 

(24) 

(25) 



Example 5.1 Let P the program in example 2.2, Then F P is a system of 23 simultaneous 
recurrence equations. The following four equations correspond to the four equations in exam- 
ple 4.1 respectively. Let A = member{X,L) and B = member{X, [X\L]). 



□ 



unify {x L K} {x L} (A, x \ 1A) ^ 5A y B ,e Va ) 

[Fp(X )](i ) 2)^-#(3,l) = 



um fy {X,L},{X,L,K}( B ^ X l 3 ,l)<^(l,l)^^ X (l,l)^9(5,l)) 
U{X,L,K} uni fV {X,L},{X,L,K}( B , X l 3 ,l)^(l,2)^ A ^ X (l,l)^(5A)) 
U{X,L,K} «/y {X,q,{^,g}( B »^(3,l)^(2,l)i^i4,l)^(5|l)) 
LJ{X,L,K} "™/2/ {X,L},{X,L,A-}(- g ! ^'(3,l)<-,(2,2)^' X (l,l)^-»(5,l)) 
U{X,L,if} u ™/y{X : L} : {Jf,L,K}( B i^(3,l)^«(4,l)'^'^ : (14)^»(54)) 
' 3 (-^"'')](l,3)<-«(l,2) = ^(1,2)<-»(3,1) U {X,L,K} -^(l,2)<-»(4,2) 



Theorem 5.2 lfpF P l\lfpF P ) if 
C3: e G 7v(ev)j and 
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C4: unif yt(A,j u (6 b ), B,^ v ( a b )) C 7 V • unify uy (A, 6 b ,B, a b ) for any finite W,VC Wfc, any 
8 b e ASubu, any er b e ^ISufey, and any atoms A and J5 such that vars(A) C W and 
vars(B) C V. 



Proof: See p. 34 



5.3 Complexity of F)> 

The cost of computing lfpF p is affected by the characteristics of P, the abstract domain, 
abstract substitutions where fc G Kg, and the algorithm for least hxed-point computation. 



Using O'Keefe's algorithm for least hxed-point computation 36 , the worst case cost of comput- 
ing IfpFp is proportional to the product of the number of operations in Fp and the maximum 
height dmax of ASub\> i for i £ K. Since LJ is much less costly than unify in most cases, we 
measure the worst case number of occurrences of unify in Fp. 

Let S be a set and dehne #S be the number of members in S. Let 

Af p = {peM P \ 3q.p^,q e £ P } 

for < j < 3. £ p is the set of edges whose ending points lie in Afp. Let pmax be the maximum 
number of predecessors that a program point has. By equations ^2] and ^5|, unify does not 
occur in equations for edges in £ P U £p. By equation |25|, unify occurs at most pmax times in 
the equation for an edge in £p. So, unify occurs at most #£p * pmax times in the equations 
for edges in £ p. By equation ^4|, unify occurs at most pmax 2 times in the equation for an edge 
in £ P . So, unify occurs at most #£p * pmax 2 times in the equations for edges in £p. Since 
#£f> < #£p and #£p < #Kc * pmax, the worst case number of occurrences of unify in F P is 
0(#Kc *pmax 3 ). Therefore, the worst case cost of computing IfpFp is 

0{dmax * #Kg * pmax 3 ) 



6 The Generic Abstract Semantics Fp 

A further approximation may be made of F P so as to reduce the complexity of program analyses. 
IfpFp is a vector indexed by edges in the program graph for P. IfpFp associates with each 
program point with several abstract substitutions, each for one edge ending at the program 
point. This results in fine analyses for applications such as program debugging. However, there 
are some applications where such fine analyses are not beneficial with respect to their costs and 
one abstract substitution for each program point is a better choice. Fp fulfills such purposes 
and is derived from Fp by one more approximation. 

The domain of Fp is T>° and each X" 6 T>° is a vector indexed by program points. X>° is 
constructed from ASuby in a similar manner as is V b . Let X } Y € P°, X° 6 ASubv p[1] and 
define 

X . Q . y« dg yp £ N {x . ^ ^ (2g) 

[x°n«YX d = f x;u Vv[1] y; (27) 
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[X° U° Y c 



T° - 
v 

■ o <tef 

p — 



— LJ Vp[1] F p ° 

def = 
= I V. 



(28) 
(29) 
(30) 



The approximation through collapsing the abstract substitutions associated with all the edges 
ending at a common program point is characterised by the following concretization function. 



(31) 



It follows immediately from equation [5l] that 7 G V ^ V b is monotonic. We now construct a 
monotonic function Fp on < V° , C°> such that F P ■ 7°(A°) C b 7 • F^X ) for every I 4 G Z>°. 



n Vpll] {unify Vg[i] y p[i] (B q ,X*,H p[1] ,e Vp[1] ) \ p^q G £p} 

?/ P G A/p 

n V p[ i] { wi /2/V a[1] ,V p[1] ( H q[l] > X g > fl i>-> X p-) I P^»<? e f l>} 

if p e A/| 
if p & Np 



X': 



(32) 
(33) 

(34) 

(35) 



Lemma 6.1 lfpFip> ^{IfpFf, 



Proof: See p. 35 



IfpFp associates each program point with an abstract substitution. The simultaneous recur- 
rence equations for Fp are simpler than those for F p and hence the computation of IfpFp is 
less costly than that of IfpFp. Fp is a generalisation of Nilsson's generic abstract semantics 
for definite logic programs |Q . Specifically if P does not have negative literals then IfpFp is 
equal to that in |34| . Nilsson later j3l| presented a generic abstract semantics that is based on 
a collecting semantics that associates with each program point a set of pairs of goal structures. 
A goal structure is very similar to a stack in our term. 

Each program point in P corresponds to an equation of Fp . By equations [32] and unify 
does not occur in equations for points in Np UA/p. unify occurs at most pmax times in an 



equation for a point in Np U Mf according to equations 33 and p4. The worst case number of 



occurrences of unify in Fp is 0(#A/p * pmax) since #A> + #A/| < #A>. Therefore, the 
worst case cost of computing IfpFp" is 

0{dmax * #Ap * pmax) 

Fp ( a generalisation of the generic abstract semantic in J34| ) may also be used to obtain 
abstract substitutions for edges. We prefer Fp to Fp for such analyses since Fp is easier to 
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specialise than Fp. In order to specialise Fp (or the generic abstract semantics in J35[) for such 
an analysis, in addition to the work required to specialise Fp for the same analysis, one needs 
to do 

• keeping information about program points in abstract substitutions, an abstract substi- 
tution for Fp is a set of pairs of a program point and an abstract substitution for Fp; 

• replacing unify by unify which, for each member of 9° x cr°, discards point information 
and calls unify. 

This amounts to requiring the analysis design who specialises Fp to undo approximation 7 . 
A call to unify may cause unify to be called as many times as pmax 2 . So, the worst case 
complexity of Fp is no less than that of Fp for the same analysis. 

We have so far developed the generic abstract semantics IfpFp and IfpFp for forward abstract 
interpretation of normal logic programs. IfpFp obtains an abstract substitution for each edge 
in £p while IfpFp obtains an abstract substitution for each program point in A/p. In order 
to specialise either of these generic abstract semantics to perform a particular analysis, it is 
sufficient to design ASub, 7, e,unify and LJ such that they satisfy C1-C4. 

7 Example 

We now illustrate how F p and Fp can be specialised to perform a particular analysis through 
groundness analysis - a simplified version of mode analysis. 

In groundness analysis, we are interested in knowing which variables will be definitely in- 
stantatiated to ground terms. Therefore, a set of substitutions is approximated naturally by a 
set of variables. ASubv — p(V). The partial order on ASubv induced by C on p(Sub) is D. 
< p(V), 2, V, 0, U, n > is a complete lattice. 

The approximation of a set of substitutions by a set of variables is modeled by the following 
concretisation function j v G p(V) l— * p(Sub). 

7 v (6» b ) = f {9 G Sub I VA G 9 b .(X9 is ground)} 

7 V is obviously a monotonic function from < p(V), 3> to < p(Sub), C> for any V C VAJZ. 
For any V, e v = and Dy = n. 

We now present an abstract unification algorithm for groundness analysis. 
Given A G ATO M (E, II, U), 9 b G ASubg , B G ATOM (E, II, V) and cr b G ASub v , the algorithm 
computes unify u V (A, 9 b ,B, a b ) G ASubv in five steps. In step (1), a renaming '5 is applied to 
A and 9 b to obtain A$> and 9 b ^> so that vars(A^)C\vars(B) = and vars(9 b ^)nvars(a b ) = 0, 
and b i$> and a b are combined to obtain = 9 bl ^/L)a b so that a substitution satisfying £ b satisfies 
both 9 bi & and a b . Note that C b € ASw&i^ijv- In ste P ( 2 )' ^0 = mgu{A^,B) is computed. If 
i?o = fail then the algorithm returns _Ly that is V. Otherwise, the algorithm continues. In step 
(3), if = downwards(Eo , C i b ) is computed so that rf is satisfied by any £ o mgu(Eo^) for any £ 
satisfying £ b . In step (4), the algorithm computes (3 b = upwards(r] b , Eq) from rf such that any 
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substitution satisfies if if it satisfies /3 b and unifies Eq. In step (5), the algorithm restricts 0° 
to V and returns the result. 



Algorithm 7.1 Let U,V C 1<!4K be finite, 6» b G ASubu, cr b G AS'ufev, wars (A) C W and 
vars(B) C V. 



unify uy (A,e ,B,a) 

let ^ be a renaming such that PI V = 0, 

*/ ^ fail 

i/ien V n upwards(Eo, downwards(Eo , b \f r U cr b )) 
eise V 

downwards(E,e h )= f 6 b U (J uars(i) 

(X=t)6_EAXe6» b 

upwards(E, 6 h ) d = 9 h U {X \ (X = t) e E A vars(t) C (9 b } 

The abstract domain and the concretisation f unct ion satisfy C1-C2 (p.|l6|) and ey satisfies 
C3. The following theorem states that algorithm 7T satisfies C4 (p.|l§l). 

Theorem 7.2 

C4' mi/tot^), B,7 v (a b )) C 7v(»ft M ,v(A 0\B, a b )) for any finite W,VC any 
# b G ASubu, any er b G ASu&y, and any atoms A and B such that wars (A) C U and 
vars(B) C V. 



PROOF: (C4') 6> b * U a b G ASufy^uv- Let C e l u ^uvi e * u °" ) and 
y £ downwards(E ,9 [ '^ U cr b ). Then cither Y~ G 6> b >3> U <r b or there is X and i such that 
X G b * U cr b , (X = t) G -Bo and F G wars(t). So, F(C o to 5 m(£; C)) is ground if 
mgu(Eo^) ^ fail. It is true that if every variable in a term is ground under a substitution then 
that term is ground under the same substitution. Therefore, if 

Z G upwards(Eo,downwards(Eo : 9 bl i> U cr b )) then Z is ground under £ o mgu(E$(). This 
and lemmas 10.1 and [10. 6| complete the proof of C4'. I 



Example 7.3 Let A = #([/, /(V, /(W, W)), V), B = g(f(X,Y), Z, X),9 b = {U} and 
cr b = {Z}. 9 b is an abstract substitution on domain U = {U, V, W} and cr b is an abstract substi- 
tution on domain V = {X, Y, Z}. This example shows the computation of unify u V (A, 6* b , B, a b ) 



by algorithm 7.1 



In step (1), a renaming * = {U/U , V/V , W/W Q } is applied to A and 6> b 

AV = 9(U J(Vo,f(Wo,W )),V ) 
b * = {U } 

and C b = 9 b ^ U a b ) = {U , Z} is computed. 
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In step (2), E = mgu(A*,B) = {U = f(V ,Y),Z = f(V ,f(W , W )),X = V } is com- 
puted. Note that Eq is written as a set of equations in solved form. 
In step (3), if — downwards(Eo , ( b ) — {Uo, Z, Vo, Y, Wo} is computed. 
In step (4), P b = upwards(E ,ri b ) = {[/ , Z, Vq, Y, W , X} is computed. 
In step (5), the algorithm computes and returns restrict^ , V) = {X, Y, Z} 
So,^fh] uy (A,9\B,a b ) = {X,Y,Z}. □ 



Exampl e 7 .4 This example shows the result of the groundness analysis of the program in 
example 2/2. IfpFp has 23 components each of which corresponds to one edge in Zp. The 
following are four of them. 

[^(s.D^d.i) = 
WpFp^^^iYZ} 

□ 



8 Related work and Discussion 

There has been much research into abstract interpretation of logic programs. For a compre- 
hensive survey, see 0j. A number of generic abstract semantics have been brought about for 
abstract interpretation of logic programs [|[ [l8| |2^, |3Cj . Abstract interpretation has been used 
in both forward and backward analyses of logic programs. A forward analysis Q approximates 
the set of substitutions that might occur at each program points given a program and a set of 
goal descriptions. A backward analysis @, ||, |2^] approximates the set of the atoms that are 
logical consequences of a program p2| . However, the problem of forward abstract interpretation 
of normal logic programs has not been formally addressed in the literature although negation 
as failure is dealt with through the built-in predicate ! in the way it is implemented in Prolog. 
We have proposed a simple solution to the problem. We now review previous work and discuss 
about the solution. 

8.1 Approaches to forward abstract interpretation of logic programs 

There are three approaches to forward abstract interpretation of logic programs. A bottom- 
up forward abstract interpreter mimics a bottom-up evaluation strategy. A top-down forward 
abstract interpreter mimics a top-down evaluation strategy. Top-down forward abstract inter- 
preters can be further divided into two sub-classes according to whether or not the underlying 
top-down evaluation strategy uses memoisation. A fixed-point forward abstract interpreter 
computes the least fixed-point of a system of simultaneous recurrence equations. 
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8.1.1 Bottom-up forward abstract interpretation 

The abstract interpreter based on Alexander Templates (AT) |L8| simulates the bottom-up 
evaluation based on AT |38| . Given a program and a goal, AT first transforms the program and 
the goal and then evaluates the transformed program and the transformed goal in a bottom- 
up manner. Given a program and a goal description, the AT-based abstract interpreter first 
transforms the program and the query description in the same way as AT does and then mimics 
the evaluation phase of AT by replacing standard unification with an abstract one. 

8.1.2 Top-down forward abstract interpretation without memoisation 

A top-down forward abstract interpreter without memoisation || [H], Q approximately 
executes a goal description by mimicing the underlying the top-down evaluation strategy. As 
an example, we take the top-down forward abstract interpreter in S. 

Given a goal description that is a pair of an atom and an abstract substitution, the top- 
down forward abstract interpreter in || constructs an abstract AND-OR graph to approximate 
the set of all the intermediate proof trees that may be constructed by SLDNF under the left- 
to-right computation rule for all the goals satisfying the goal description. In other words, any 
intermediate proof tree for any goal satisfying the goal description can be obtained by unraveling 
the abstract AND-OR graph. An AND-nodc is a clause head and its child OR-nodes are the 
atoms in the body of the clause. Every OR-node is adorned with one abstract substitution to 
the left, called abstract call substitution, and with another to the right, called abstract success 
substitution. The abstract success substitution of an OR-node is the abstract call substitution 
of its right sibling. 

The initial abstract AND-OR graph has one OR-node that is the atom in the goal de- 
scription and is adorned to the left with the abstract substitution in the goal description. 
Suppose that the abstract AND-OR graph has been partly constructed. Consider an OR- 
node A with abstract call substitution f3. The abstract interpreter computes the abstract suc- 
cess substitution of OR-node A as follows. For each clause d = Hi <— -E>(i,i), • ■ • , 5(j, m [i]) 
such that Hi may match with AO for some satisfying (3, the abstract interpreter adds 
to OR-node A a child AND-node Hi that has m[i] child OR-nodes • ■ • > -B(i,m[il) an d 

computes the abstract call substitution (3\ n of OR-node Bux\ - the first child OR-node of 
AND-node Hi. (3\ n approximates the set of the most general unifiers of Hi and AO for 
all satisfying (3. The abstract interpreter extends OR-node -B^i) by recursively applying 
the same process and extends OR-node B^j+i} in the same way after it has computed the 
abstract success substitution of OR-node Bujy Eventually, it will have computed the ab- 
stract success substitution j3 l out of OR-node Bu m un. After computing (3 l out for each clause 
Ci = Hi <— • • ■ , B(i !m [i]) such that Hi may match with AO for some satisfying /3, the 

abstract interpreter computes the abstract success substitution (3' of OR-node A and (3 1 ap- 
proximates from above the set of the most general unifiers of AO and Hit] for all the satisfying 
(3 and all the r] satisfying (3 % out . 

Since there are recursive calls, H introduces a fixed-point component to the abstract inter- 
pretation process. Suppose that an OR-node A with abstract call substitution (3 were to be 
extended. If A has an ancestor OR-node A' with abstract call substitution (3' such that A is a 
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variant of A and is a variant of /3', the abstract interpreter adorns OR- node A to the right 
with the infimum abstract substitution and proceeds until the abstract success substitution of 
OR-node A' is computed. The abstract interpreter then repeatedly recomputes the part of the 
AND-OR graph starting from the abstract success substitution of OR-node A to the abstract 
success substitution of OR-node A' by using the abstract success substitution for OR-node A' 
as the abstract success substitution for OR-node A. This fixed-point process finishes when there 
is no more increase in the abstract success substitution of OR-node A' . The same fixed-point 
component is also used to limit the sizes of abstract AND-OR graphs. 

[ |3l[ [ii] , |ir| differ from || only in dealing with recursive calls. and |4(| make use of a 

memo table and [H uses stream predicates. 

8.1.3 Top-down forward abstract interpretation with memoisation 

An abstract interpreter based on an evaluation strategy with memoisation mimics the under- 
lying evaluation strategy with memoisation by replacing concrete substitutions with abstract 
substitutions and the concrete unification with an abstract unification. For an introduction to 
evaluation strategies with memoisation, see [ [f5) . 

The abstract interpreter based on OLDT resolution jl9|, [2(], mimics the OLDT resolu- 
tion [|3£j. The left-to-right computation rule is used in OLDT resolution. Given a goal that is 
a pair of a sequence of atoms and a substitution, OLDT resolution |3{| constructs an OLDT 
structure for the goal. An OLDT structure consists of a search tree, a solution table and an 
association. An entry of the solution table has a key and a solution list. The key is an atom 
and the solution list is a list of atoms that are instances of the key. Each node of the search 
tree is a pair of a goal and a substitution and each edge of the search tree is labeled with a 
substitution. The association is a group of pointers between the nodes of the search tree and 
the entries of the solution table. 

Initially, the search tree has one node that is the pair of the sequence of atoms and the 
substitution, and both the solution table and the association are empty. OLDT resolution 
extends the OLDT structure as follows until it cannot be further extended. Suppose that the 
OLDT structure has been partly constructed. Consider a node < (A,R),a > in the search 
tree where A is an atom, R a sequence of atoms and a a substitution. If there is an entry 
in the solution table with a key that is a variant of Aa then < (A, R),a > is called a lookup 
node. Otherwise, it is called a solution node. OLDT resolution extends the OLDT structure by 
extending its lookup nodes and its leaf solution nodes. If < (^4, R), a > is a leaf solution node 
then OLDT resolution first adds into the solution table an entry whose key is Aa and whose 
solution list is an empty list that will be filled in later. OLDT resolution then, for each clause 
Ci = Hi <— , ° ° ° , Bu m i{n such that Hi and Aa unify with 9 being the most general unifier, 
adds < • • • , B(j, m rii), R), a o 9 > as a child node to node < (A, R), a > and labels the 

edge from node < (A, R), a > to node < (B^ ^, • • • , B^ t7n ^, R), a o 6 > with 9. These child 
nodes will then be extended by the same process. If < (A,R),a > is a lookup node then, 
OLDT resolution first adds < R,a o 9 > as a child node to < (A, R),a > and labels the edge 
from < (A,R),a > to < R, a o 9 > with 9 for each solution Aa9 in the solution list for key 
Aa, and then adds to the association a pointer from lookup node < (A,R),a > to the tail 
of the solution list for key Aa. This pointer will be used to add more child nodes to lookup 
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node < (A,R),a > because at the moment lookup node < (A,R),a > is first extended, some 
solutions for Aa might be unavailable from the solution list and will show up later. When a 
unit clause is resolved with a leaf solution node < (A, R),a > , the unit clause completes a 
sub-refutation for Aa and may also completes sub-refutations for the leftmost atoms of other 
nodes along the path from < (A,R),a > up to the root of the search tree. Whenever a unit 
clause is resolved with a leaf solution node, OLDT resolution updates the solution lists for those 
keys that corresponds to completed sub-resolutions. After the solution list for a key is updated, 
OLDT resolution expands those lookup nodes that have pointers pointing to the solution list 
accordingly. 

The abstract interpreter mimics OLDT resolution closely by constructing an abstract OLDT 
structure for a goal description that is a pair of a sequence of atoms and an abstract substitution. 
The nodes of the abstract OLDT structure are pairs of a sequence of atoms and an abstract 
substitution instead of a concrete substitution and the edges of the abstract OLDT structure are 
now labeled with abstract substitutions instead of concrete substitutions. The key of a solution 
table entry is now a pair of an atom and an abstract substitution and so is each solution in 
the solution list for the key. The abstract interpreter mimics OLDT by replacing the concrete 
unification function with an abstract unification function and the concrete composition function 
for concrete substitutions with an abstract composition function for abstract substitutions. 

8.1.4 Fixed-point forward abstract interpretation 

Given a program and a set of goal descriptions that are abstract atoms, f3C| derives a system 
of concrete simultaneous recurrence equations whose least solution approximates the set of all 
the input atoms and the set of all the output atoms that occur in an intermediate proof tree 
derivable from the program and any goal satisfying one of the goal descriptions. The system of 
concrete simultaneous recurrence equations is approximated from above by a system of abstract 
simultaneous recurrence equations with each concrete operation being replaced by an abstract 
operation. Abstract interpretation is done by computing the least fixed-point of the system of 
abstract simultaneous equations. 

Given a program and a set of goal descriptions each of which is a pair of a goal and an abstract 
substitution, (34j derives a system of concrete simultaneous recurrence equations whose least 
solution gives each program point a superset of the set of all the possible substitutions at the 
program point during the satisfaction of any goal satisfying one of these goal descriptions. A 
system of abstract simultaneous recurrence equations is derived to approximate from above the 
system of concrete simultaneous recurrence equations in the same manner as in pc| . Abstract 
interpretation is accomplished by computing in an abstract domain the least fixed-point of the 
system of abstract simultaneous recurrence equations. 

collects a set of substitutions for every program point. Q collects the set of input 
atoms and the set of output atoms. Collecting the set of input atoms corresponds to collecting 
a set of substitutions for the entry point of each clause, applying each substitution in the set 
to the head of the clause to obtain a set of atoms for the clause and then lumping together the 
sets of atoms for all the clauses as well as the given set of input atoms. Similarly, collecting the 
set of output atoms corresponds to collecting a set of substitutions for the exit point of each 
clause, applying each substitution in the set to the head of the clause to get a set of output 
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atoms for the clause and lumping together the sets of output atoms for all the clauses. 

[ |30| uses the idea of a trace to summarise the execution of a query. When making abstraction, 
the sets of call substitutions of different calls to the same predicate are lumped together in a 
single set input. Similarly, the set of success substitutions are lumped together in the set 
output. 

In jL7|, contexts are recorded only at the entry of each program clause. |l7j is also a 
generic procedure, their core semantics is augmented with application dependent auxiliary 
functions that are similar to abstract operations in 0. These auxiliary functions operate on 
abstract domains consisting of appropriate approximations of the collecting semantics. They 
distinguish between different call instances. However, there is only one instance of every clause, 
so substitutions originating from different call instances are lumped together. 

8.2 The Negation as Failure 

The treatment of negation as failure in Fp and Fp is simple. The transition system Fp 
approximates VSLDNF (an equivalent of SLDNF) by assuming that a negative literal always 
succeeds while it may fail. This approximation introduces noises into Fp and Fp. However, it 
is difficult within the provisions of abstract interpretation to improve on this simple solution. 

Let {~^A)a be selected by SLDNF where -*A is a negative literal in the body of a clause in the 
program and a be a substitution. During abstract interpretation, a is not known and possible 
values for a are described by an abstract substitution cr g ASuby , often called the abstract call 
substitution for (^^4). Since <r° usually describes an infinite set of substitutions, it may well be 
the case that Aa succeeds for some a 6 7y(c b ) and fails for other a S 7y(c 1 '). We take cr as the 
abstract success substitution for ->A by simply assuming that Aa fails for all a £ 7y(°' b ) (~>Aa 
succeeds). An improvement needs making the abstract success substitution for -^A stronger, 
that is, replacing a b with another abstract substitution if £ ASub\> such that if Cy a 9 . Let us 
assume that 7v( (T '')\7v( r ?) ^ $ f° r otherwise if is no stronger than a . By safeness requirement 
for negation as failure and safeness requirement for abstract interpretation, it is necessary to 
be able to infer 

yO G 7v(c )\7v( 7 7)- ( e is a computed answer for P U {<— AO} ) (36) 

To infer ^56|, we need to under- estimate success and over-estimate failure in order to make the 
analysis safe. However, abstract interpreters over-estimate success and under- estimate failure. 
Note that the word approximation in abstract interpretation means approximation from above. 
An abstract interpreter over-estimates success by means of an abstract unification function 
which approximates the normal unification function from above, that is, over-estimates the 
success of the normal unification function. To infer |36j, we must use a unification function 
which approximates the normal unification function from below. Such a unification function 
should succeed only if the normal unification function succeeds. Of course, we could use a 
unification function which always fails. But, this does not achieve any improvement since such 
a unification function will make Ao always fail. Since an abstract domain for an analysis is 
much simpler than the concrete domain, some information about a set of substitutions is lost 
when the set of substitutions is approximated by an abstract substitution. It is difficult to 



26 



design a unification function which approximates the normal unification function from below 
based on abstract substitutions because abstract substitutions are inaccurate descriptions of 
sets of substitutions while such a unification function, we believe, needs accurate descriptions 
of sets of substitutions. 

9 Summary 

We have presented and justified a simple solution to the problem of forward abstract inter- 
pretation of normal logic programs and derived generic abstract semantics IfpFp and IfpFp 
of normal logic programs. The solution is simple and it amounts to replacing the negation as 
failure rule with an unconditional derivation rule. 

IfpFp and IfpFp can be specialised for various analyses. An analysis can be thought of as a 
series of approximations of the operational semantics as shown below. An arrow from A to B 
reads as "A is approximated by B" . 




abstract domain and corresponding concretisation function, to provide a function for computing 
abstract identity substitutions, a function for computing the least upper bounds, and a function 
for computing abstract unifications. The abstract domain and the concretisation function must 
satisfy C1-C2, the function for computing abstract identity substitutions must satisfy C3, and 
the function for computing abstract unifications must satisfy C4. 



We deal with negation as failure by approximating SLDNF with a transition system. The way 
negation as failure is dealt with may be generalised to deal with built-in predicate !. Although, 
we have not dealt with other built-in predicates, we believe that the generic abstract semantics 
can be augmented to deal with these built-in predicates in the way they are dealt with in || . 

10 Appendix 

Lemma 10.1 Let p be a renaming such that vars(ap) r\vars(b) — and vars(<f>p) r\vars(ip) — 
0. If (ap)(<fip) and btp unify then ap and b unify. 

Proof: Let ap = a' and <f>p = <fi' . If a'4>' and bip unify then there is a substitution 8 
such that a'(f>'9 = b(f>6. We have vars(a') (~1 dom(ip) = and rang(iji) n dom(<j}') = and 
vars(b) n dom{4>') = 0. Hence, a'tfxp'O = bip<p'9. Therefore, ap and b unify. I 
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Lemma 10.2 Let A and B be two atoms, and pi and p 2 be two renamings such that 

dom(pi) — dom(p 2 ) 2 vars(B) (37) 
rang{p 1 ) f~l vars{A) = (38) 
rang(p 2 ) l~l wars(A) = (39) 

Then 

(a) A and Bp\ unify iff A and Bp 2 unify. 

(b) mgu(A, Bpi) f wars(A) = mgu(A,Bp2) f vars(yl). 

(c) pi o mgu(A, Bpi) f dom{p\) = p 2 ° mgu(A, Bp 2 ) t dom(p 2 ). 

PROOF: Let uars(A) = {Xl, • • • , Jf^}, dom(px) = dom(p 2 ) = {Vi, - ■ ■ ,Vi}, 
Pi = {Vi/n, • • • , VJ/YJ} and ^ = {Fi/Zi, • • ■ , Vi/Zi}. Define 

P 4 ^{yi/Z!,---,^/^} 

^ = J {ii,---,y} 



We have 
and 



V =^ {Vi, • • • , Vj} 



Pi = p2 o p 3 T V (40) 



P2 = Pi ° p4 T V (41) 
Suppose that A and _Bpi unify with $i being their most general unifier. Let 

9i = {X n / Xil ,---,X is /x is , Y n /y n , ■ ■ ■ , Y jt /y jt } (42) 

with 1 < ii < ■ ■ ■ < i s < k and 1 < ji < ■ ■ ■ < jt < I. Define 

4g( Y h if h / ■,*} (43) 

\y h If he {31,32, ■■■,3t} V ' 

By equations ^2 - 43 , we have 

Pl o6i T V = {V 1 /y 1 ,---,V l /y l } (44) 

Ap 3 6i = A0i = BpiOi = B{p 2 o p 3 t V)6i = Bp 2 p 3 9i by equations [T| ||, f«| || and ||. So, 
A and Bp 2 unify with p 3 9i being one of their unifiers if A and Bpi unify with 9i being their 
most general unifier. 
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Suppose A and Bp 2 unify with 9 2 being their most general unifier. Let 
^2 /"Bin 1 ' ' ' 1 X Up /%up ? Z Vl / z Vl , ■ • ■ , Z Vq / z Vq } 

with 1 < u\ < ■ ■ ■ < Up < k and 1 < v± < • • • < v q < I. Define 



Zh 



def J Z h Ifhg{v 1 ,V 2 ,---,V q } 

Zh If h G {vi, v 2 , ■■■ ,v q } 



By equations ^5 - 46 , we have 



P2°02 T V = {V 1 /z 1 ,---,Vi/zi} 



(45) 



(46) 



(47) 



^402 = A6 2 = Bp 2 6 2 = B{px o p 4 | V)0 2 = Bp 1 p i 9 2 by equations |37]-|38|, |4lJ and|45j^§ So, 
A and Bpi unify with ^4^2 being one of their unifiers if A and Bp 2 unify with 8 2 being their 
most general unifier. Therefore, (a) holds. 

The following equation results from equation Wa and the definition of p^. 



( {X n /x h ,- ■ ■ ,X is /x is } \ 

U 

i Y io/yja \ l<o<tAY jo gZ} 

u 

{Zi/yi, ■■■ ,Z t /yi} 



(48) 



The following equation results from equation |45j and the definition of p^. 



P4&2 = 



' {^"ui /-Kill 1 ' ' ' 1 X Up I ' X Up } 1 

u 

{Z v Jz Vo I 1 < < qAZ Vo 
U 

{Y 1 /z 1 ,---,Y l /z l } 



(49) 



Since P462 is a unifier of A and Bpi, there is a substitution 5\ such that p±6 2 = ^1^1- By 
equations [42 and [49, we have 



' {X U1 / x ui , • ■ ■ : x. Up / x Up \ ^ 

U 

{Z v Jz Vo I l<o<qAZ Vo gy} 
U 

\ {n/zx,...,^/^} J 

u I A, 

{ y ii/yji>---> y j t /^ t } 



(50) 
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Since p3#i is a unifier of A and Bp2, there is a substitution (52 such that p^,9i = 6262- By 
equations and we have 

/ {X i Jx il ,---,X i Jx i3 } \ 
U 

{Y jo /y jo \ 1 <o<t AY jo & Z} 
u 

{Zi/yu--->Zi/yi} 



\ 



(51) 



U I (5 2 

By equation ^0|, {X^ , • • • , Xi a } C {-Xtt! , • ■ ■ , X Up } and, by equation 51 
{X Ul ,---,X Up } C {.V, .•••..V, }. So, (.Y...---..Y, 1 - {X Ul ,---,X Up }. We have s = p 
and !„ = m for 1 < o < s. We also have, from equations |5C|-|5"lj 



.7, / 



XiJ 2 

x lo 5\ 



Xi o = Xi o for 1 < o < s from equation |5^. Therefore, (b) holds. 
By equation |5^, we have 

Y h /z h eSi If h<£{ji,---,jt} 
Zh=Vh5\ If h 6 {ji,- ■ • ,j t } 



(52) 



(53) 
(54) 



y h =Y h iih^ {ji, ■ ■ ■ , j t } by equation |43j. So, z h = y h 5i for h £ {jt, ■ ■ ■ ,j t } by equation |53 
This and equation ^4] imply that, for all 1 < h < I, 

Zh = Vh^i (55) 

By equation we have 

Z h /y h £5 2 If h<?{v lr --,v q } (56) 

Vh = z h S 2 If he {vi, ■ ■ ■ , v q } (57) 

Zh = Zh if h $ • • • , v q } by equation [l(| So, yh — Zh&2 for h $ {vi, • • • , v q } by equation |56[ 
This and equation ^7] imply that for all 1 < h < I 

Vh = z h &2 (58) 
By equations 0, 0, || and ||, p x o 6»i | V = p 2 ° 6» 2 T V. Therefore, (c) holds. I 

Corollary 10.3 Let A and B be two atoms and p be a renaming such that dom(p) D vars(B). 
If vars(A) n vars(B) = and vars(A) n vars(Bp) = then ^4 and i? unify iff ^4 and -Bp unify, 
and 

mgu(A, B) f vars(B) = (po mgu(A, Bp)) | vars(B) 



Proof: The proof results immediately from lemma 10.1. (a) and (c) by letting p 2 = p and 



P! be the renaming on vars(B) such that Xp! = X for each X € vars(B). 



30 



Corollary 10.4 Let A and B be two atoms, pa and ps be renamings. If 

dom(pA) 2 vars(A) 

dom(pB) 2 vars(B) 
vars(Apji) n vars(B) = 
vars(Bps) H wars(A) = 
then ApA and i? unify iff A and Bps unify, and 

(pA o mgu(ApA, B)) f dom(pA) — mgu(A, Bps) T ua^s(A) 



Proof: We prove the j/ part. The ojiZj/ i/ part is a dual case of the if part. Let p' B 
be a renaming such that dom(p' B ) = dom(pB), vars(Bp' B ) D vars(A) = and vars(ApA) H 
vars(Bp' B ) = 0. 

Suppose that A and Bps unify. By lemma 10.1. (a), A and -Bp^ unify, and mgu(A,Bp' B ) f 
vars(A) = mgu(A, Bp B ) f vars(A) by lemma 10.1 .(b). By corollary 10.5 , ApA and -Bp^ unify, 
and 

Pa ° ?ngu(ApA, Bp' B ) f vars(A) = mgu(A, Bp' B ) | vars(A) 

So, pa o m<7tt( J 4p J 4, -Bp^) T wars(A) = mgu(A, Bp B ) f uars(A). By corollary 10.3) , ApA and B 
unify, and 

mgu(Ap A , B) | vars(ApA) S mgu(Ap A , Bp' B ) | rars(4/)A) 

hence pa o mgu(ApA, B) \ vars(A) = pa ° m 9 u (ApA, Bp' B ) f uars(A). Therefore, pa o 
mgu{ApA, B) f wars(A) = mgu(A, Bp B ) f wars(A). It now suffices to prove pA°Tngu{ApA, B) j 
dom(pA) — Pa ° ~mgu(ApA, B) | varsf^). Let pjj = pa T wars(A) and Pa = Pa T (dom(pA) — 
vars(A)). We have pa = p A U p^, 

PA o mgu(ApA, B) | dom(p A ) 

= (Pa u Pa) ° ™3"(^(Pa u Pi)) B ) T dom{p A ) 
= Pa° m S"(A°A> B ) T wa^s(A) U p A 

and 

PA ° fngu(ApA, B) f wars(A) 

- (p A U p A ) o mgu{A{p 1 A U p A ), B) | vars(A) 

— Pa° m 9 u (ApA, B) | vars(A) 

We also have rang(p\ o mgu(Ap\ 7 B) f wars(A)) n dom{p A ) = and 
dom(p 2 A ) n uars(A) = 0. So, 

(pA ° mgu(Ap A , B) | wars(A)) o p A 

= (p A o mgu(Ap\, B) f wars(A)) o p^ 
= p A o mgu(Ap\, B) f varsf^) U p A 
= pa o mgu(Ap A , B) \ dom(p A ) 
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Therefore, pA o mgu(ApA, B) f dom(pA) = pa omgu(ApA, B) f vars(A) since is a renaming. 

I 

Lemma 10.5 Lei 6± and 02 be two substitutions and Va set of variables. 

Oi o 2 t V - (0i T V) o 2 t V 

Proof: Let (X/t) € 0i o 2 T V. Then leV. Either X G dom{6x) or X <£ dom(6i) Me 
dom(9 2 ). If X e dom(8 1 ) then there is ii such that ((X/ti) G #i A i = ti0 2 )' Since leV, 
(X/ii) G 0i T V and hence X/(ti<9 2 ) = (X/t) G (0i | V) o 2 f V. Otherwise, X £ dom(0 2 ), 
(X/i) G 2 and (X/t) G (0i T V) o 2 T V. 

Let (X/t) G (0i T V) o 2 t V. Then X eV. Either X G dom(0i | V) or X £ X | V A X G 
dom(6 2 ). IfX G dom(0i | V) then there is t 2 such that ((X/t 2 ) G 0i t VAt = t 2 6 2 ). (X/t 2 ) G 9 1 
and (X/t) G 0i o 2 . So, (X/t) G 0i o 2 t V. Otherwise, (X/t) G 2 and X g dom(0i) n V. So, 
(X/t) G 0i o 2 t V. I 

Lemma 10.6 Let £i and i?2 be two sets of equations, and 9\ and 02 be two substitutions. If 
0i = mgu(Ei) and 02 = mgu{E 2 0{) then 0i o 2 = mgu{E\ U E 2 ). 



Proof: See 



Proof of lemma 3.1 



(pj|) : VSLDNF and SLDNF deals with negative literals in the same 



manner. Therefore, it remains to prove for the cases where the leftmost literals are positive. 
The proof has two parts. The first part corresponds to procedure-entry and the second part to 
procedure-exit . 

We first prove that if &(j,k) T — Pj ° T (j,k) T then R2 (p.^) is derived from Rl (p.0) iff 
R2' (p||) is derived from Rl' (p||) and | V ij+1 = p J+1 o t u+1a) f V l]+1 . 

Let C(j,fc)V^ = Pj o Tu^fy t Vj. . Then there is a renaming 6 such that 

(o-(j.fe) T ) o 5 = pj o T (j >k ) T (59) 
By choosing the same clause CV +1 to be resolved with both Rl and Rl', we have that R2 is 



derived from Rl iff R2' is derived from Rl' by corollary 10.4 (p. 31). Suppose that R2 were 
derived from Rl and R2' from Rl'. 

B (.ij,k)Pj T (j,k) 

= B(i jt k)(P3 ° r (j,k) T ('.' vars{B [l]tk) ) C V %1 ) 

= B {ii,k){{v(j,k) T Vy) oS) ('.' equation 59) 

= B {ii,k)V(j,k)5 ('■' vars(B( ijik) ) C Vi 3 -) (60) 

ft+l OT (j+M) t Vi j+1 
= pj+i o r^jfe) o ?7 1 Vi J+ i ('•' equation 2) 

= o r) t V ij+1 ('.' equation 1 ) 

= p j+ i o mgu{H ij+1 pj +1 ,B^.^pjTQ^)) T ('•' eguation| ) 

= pj+i omgu{H lj+1 p, J+1 ,B {l . M) (j (: j Jt) 8) ] V i]+1 ('.' equation^ (61) 
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Let 5 be the inverse of 5. 



^(3+1,1) T V ij+1 

= mgu(H ij+1 , B^. ik) a^^ip^ +1 ) T V ij+1 (V equation 6) 

= mgu(H ij+1 , B^.^o-y^SSipj+i) t V; 3+1 ('.' <W is identity) 

= mgu{H lj+1 ,(B {i ^ k) (T ihk) 5)(5o^ :j+1 )) | V ij+1 

= mgu(H ij+1 ,(B {i . tk )a(j >k )S)(6oip j+1 )) f vars(H ij+1 ) (62) 



^(j+i.i) T Vij+i — Pi+i ° T (j+i.i) T ^-tj+i by corollary 10.4 and equations pl|-|62|. This completes 
the first part of the proof. 

We now prove that if <J( J+ i, m [ lj+1 ]+i) T = Pj+i ° T then <J(j, k +i) T V i;j = 

Pj ° T (j,k+i) T Vu ■ Let (5' be a renaming such that 

V(j+l,m[ij+i}+l) T = (pj+1 °T,-, fc+ i T V, 3 . +1 ) o<5' 

and <5' be the inverse of 5'. a(j+\,m[i i+l )+i) = Pj+l ° Tj,k+i ° S' f V lj+1 . Therefore, 

#ij+i (T (j+l,mfe + i]+i) < A?+i 
= ^ J+ iPj+i' r (j : fc+i) < 5 /( / , j+i 

= H i]+1 p j+ iT (jyk) r]9S'<pj + i ('.' equation A) 

= H i]+1 p J+1 nQ5'(j) j+1 ('.' vars(C ij+1 p j+1 ) Hvars^pj) = 0) 

= B (.i jt k)Pj T (j,k)VQ$'<f>j+i ('•' equation |) (63) 



By equation 
So, 



o-(j,fc)) T V,: 3 = o r (jifc ) o (5 | (64) 



= B (ij,k){Pj ° 7j,fc ° <5 T V,-J ('■' equation 64) 

= B( ijtk )pjT jjk 5 ('.' vars(B {i .. k) ) C V b ) (65) 

Substituting equations [63]-|65| into equation Q and letting A = B(j,. tk }pjTj ;k , we have 
T 14,. _ 

= pj o Tj^ o (5 o mgu(AS, An9S' '4>j+i) T 

= pj o Tj,fc o (<5 o mgu(A5, An95' 4>j+\) | uars(A)) j Vi, ('.' lemma 

— Pj ° r j.*: ° ( TO 5 u ( j 4i An95'(j)j+i) f uars(A)) j ('.' lemma 
= pj o rj : fc o mgu(A, An95' '</>j+i) f V*. ('.' lemma 
= Pj ° Tj,fe orjo9 o 5' o (j>j +1 | Vi 3 

— Pi ° r j,*: o o T Vjj ('■' <^'j <A?'+i are renamings) 
= Pj ° T U,k+i) T Vij ('•' equation |) 



10.5) 



10.3) 



10.5) 



Proof of lemma 4.2 



(p. 15): It is sufficient to prove that Fp f C 7^(Fp f fc) for any 



ordinal fc. The proof is done by transfinite induction. 
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Basis. F P T = C {$} = 7 »(jJ) = 7 »(F P ] 0). 

Induction. Let Fp f k' C 7"(i 7 P t fc') f° r an Y fc' < fc- If fc is a limit ordinal then .Fp f fc = 
□"{Fp T k' | fc' < fc}. Therefore, ^(F p \ fc) D ^{F p ] k') for any k' < k by equation ||. By 
the induction hypothesis, 7 tt (F* t fc) D F> j fc' for any fc' < fc. So, F> | k C 7 H(Fp | fc). 



Let fc not be a limit ordinal. Let || pi^-»gi, Q\ || • ■ • | Pr»<-*9n; $n |r$ G -Pp T fc- By equation 15 



it is sufficient to prove that Pi<-»qi G £p A 0j £ [Fp f ft] ^ . for any 1 < i < n. 

There is < j < 3 such that || p 1 ^q 1 ,6 1 || • • • || p n ^q.*,d n \\ ■ $ G Fp(P> f (fc - 1)) by 
equation 10. 

Let j = 0. By equation [ll], n = 1 and pi<-»gi £ £ P A Q\ G ©p[i]- So, by equation 17, 

llPi^gi,^ || • • • || Pn^«r..«T. H-$e 7 tt (^pTfc)- 

Let j=l. By equation ^ pi<-»<?i £ £p, p 2 = <7i and || p 2 ^q 2 , #2 || • • ■ || Pn^»q n ,8 n \\ • $ € 
i^P t (fc - 1) and 6>i = unify(B qi ,a,H pi[1] ,e) ^ fail. || p 2 ^q 2 ,B 2 II • • • II Pn^»q n ,0 n \\ ■ $ g 
7 J (Fp t (fc - 1)) by the induction hypothesis. || pi<-mqi,0i || • • • || p n ^q n ,0 n \\ ■ $ G 7 s (Fj, T fc) 
by equation [l^ and the monotonicity of Fp . 

Let j = 2. By equation [l3|, G £j> and there are two stack items || qx<r-»u,a | and 

|| pi<r-»v, r\ || such that 

|| qi<r+u, a || • || Pi<-»«, ?/ || • || p 2 <-»q 2 , 6*2 || • • • || p„^»q„,0 n || • $ G Fp t (fc - 1) 

6 1 = unify(H qi[1] ,a,B PT ,ri) / fail 

|| P2<— •^2; $2 || ' ' ' || Pn^-*qn? @n || ' $ G 7 tt (Fp t (fc - 1)) by the induction hypothesis. Therefore, 
|| pi<-»qi,6i || • • • || p n <-»?n) #n II ' $ G 7 tt (Fp j fc) by equation [l^ and the monotonicity of Fp. 

Let j = 3. By equation [l4|, p\<-»qi G £j>, p 2 = <?i, and || p 2 ^»q 2 ,9 2 | • • • || p n <~*q n , On || • S G 
Fp t (fc — 1). By the induction hypothesis, || P2<-»<i , 2 I ^2 || • • • || 2?n<-«Sn> #n || • $ G 7"(Fp | 
(fc — 1)). Therefore, || pi<-*qi,6i \\ ■ ■ ■ || p n *-»Qn>6 n \\ ■ $ G 7"(Fp | fc) by by equation |2(] and the 
monotonicity of Fp. 

Therefore, Fp 1 k C 7"(F p f fc) for any ordinal fc. I 



PROOF of theorem (p|l7|): (C4) implies that F P is monotonic and therefore IfpFp 
exists. By theorem [O], it suffices to prove that, for any X b G V b , F p ■ j b (X b ) c" 7 b • Fppf b ). 

Let <T G [Fp • 7 b (^ b )] p ^. g - We need to prove a G [ 7 b • Fp(X b )] p ^ g . Let G £ P for some 
• ./ • ::. 

Let j = 0. By equation 0, <r G 7v p[1] (^[i])- By equation || a G 7Vp[1] ([Fp(X b )] p ^) = 
[7 b -Fp(X%^. 

Let j = 1. By equation |l8|, there is u G Afp such that q<-»u G fp and 
cr G unify^{B q , [l^{X b )] q4r _ 9u , H p [i], {e}). By equations |l8| and |2l|, C3 and the monotonicity 
of function unify^ in its fourth argument, 

cr G unify* {B q ,j V q[i] (X b q ^ u ), H p[1] , 7Vp[i] (e Vp[1] )) 
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So, by equations ^T] and ^3] and the monotonicity of 7 y , 

° e 7v p[1] ([^(^ b )] p ^ g ) 
C 7v p[11 ([^(^ b )] p ^ 9 ) 
= [7 b -4(^)U 9 
Let j = 2. There are u,v £ Afp such that p~<-»v, q*-»u G £p and 

a G ™/y»(H g[1] , [7 b (^ b )]^ ;J B P -, [7 b (* b )] P -^J 
by equation [[{]. By equations |l] and [24|, 

a G unifyt( H q[1] , 7 Vg[i] (J^ J, -Bp-, 7y,, tl] P^-^-. J) 



- 7v p[1] ■™/!/v, |11 ,v p|11 (fl 9 [i].^». 5 P -> 1 r H») 
C 7v p[1] ([^(^ b )] p ^) 
= [f^P^U 

Let j = 3. By equation |2(i|, there is u G Np such that a G [7 b (^ b )] g ^,„- By equations ||l| 
and |25|, 

= 7v 9[ ij (^«<-»«) 

C 7v 9[1] ([^(^ b )] p ^ g ) 

= 7y p[1] ([^(^ b )] p ^ g ) 

= h b -Fp(x%^ q 



PROOF of lemma |67] (p. 19): It suffices to prove that, for any p<-*q G £p and any 1° G P°, 



• 7 (^ )] p ^, 9 = fr'-W)^,- By equation |l 



Vp^g G fp.([ 7 °(^°)] p ^ 9 = X;) (66) 



Jp<-»<3 

Let p^q G £0 . [F> • ^(X% = p[1] = [7° • by equations |§ || and @ 

Let p<-«? G £p. By equations p3|, |3J3 and E36], 



Let p<-»g G £p. By equation E4, p4 and pq, 



tfp ' 7°PH] P ^ = unify Vqm tVp[i] (H q[1] , X», B p .,x;_) = • 
Let G £p. By equations |25|, |35| and ^36], 

[Fp ■ i°{x«)] p ^ p - = x;_ = [ 7 * • 

Therefore, [F P ■ 7°(^°)] p ^ g = [7* 
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